Classification of Documents in E-Learning Using Multidimensional Latent Semantic Analysis
نویسندگان
چکیده
In this paper we consider the problem of dimensionality reduction techniques. Two techniques such as Independent Component analysis (ICA) and multidimensional latent semantic analysis (MDLSA) are proposed. A new document analysis method named multidimensional latent semantic analysis (MDLSA) which resolves the problem of in-depth document analysis, mines local information from a document efficiently with respect to term associations and spatial distributions. The MDLSA first partitions each document into paragraphs and later builds a term ―affinity‖ graph. Each element of this graph represents the frequency of term co-occurrence in a paragraph. We then use Independent Component Analysis (ICA) which finds a linear representation of nongaussian data such that the components are statistically independent. Thus these two techniques are examined in retrieving and classifying the e-learning documents. It is also proven by experimental verifications that the proposed technique outperforms current algorithms with respect to accuracy and computational efficiency. INDEX TERMS Independent component analysis, Multidimensional latent semantic analysis, affinity graph.
منابع مشابه
Query expansion based on relevance feedback and latent semantic analysis
Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...
متن کاملPresentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures
Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...
متن کاملPresentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures
Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...
متن کاملBig Data Categorization for Arabic Text Using Latent Semantic Indexing and Clustering
Documents categorization is an important field in the area of natural language processing. In this paper, we propose using Latent Semantic Indexing (LSI), singular value decomposing (SVD) method, and clustering techniques to group similar unlabeled document into pre-specified number of topics. The generated groups are then categorized using a suitable label. For clustering, we used Expectation–...
متن کاملCapturing the semantic structure of documents using summaries in Supplemented Latent Semantic Analysis
Latent Semantic Analysis (LSA) is a mathematical technique that is used to capture the semantic structure of documents based on correlations among textual elements within them. Summaries of documents contain words that actually contribute towards the concepts of documents. In the present work, summaries are used in LSA along with supplementary information such as document category and domain in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015